CDS

Accession Number TCMCG021C36737
gbkey CDS
Protein Id XP_029117777.1
Location join(89796..89833,92239..92332,93351..95644,97171..97302,97470..97542,97851..97890,98155..98227,99291..99356,102643..102720,105407..105579,118979..119043,119170..119413,119498..119793)
Gene LOC105035387
GeneID 105035387
Organism Elaeis guineensis

Protein

Length 1221aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268357
db_source XM_029261944.1
Definition uncharacterized protein LOC105035387 isoform X5 [Elaeis guineensis]

EGGNOG-MAPPER Annotation

COG_category L
Description DNA binding domain with preference for A/T rich regions
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
KEGG_ko ko:K15200        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGGACAACTGCCTCGTAAAACAAGAAATTCAGACAGCAATGATTTTGTCATGCATGCTGGTGGGCATGTCTGGGGATTGGACTGGTGTCCCCAGATCCATGAAAAGCCTCATTCTAATATCAAATGTGAGTATCTTGCAGTTGCTGCTCACCCTCCTGGTTCTACATATCACAAGATAGGTGTCCCATTGATTGGAAGAGGTGCCATTCAAATCTGGTGCCTCTTAACTTTGGATGAGAAAGTGGAATTTTCTCTACCAAGGTCCAAAAGAGGGAGACCTAAAAAGGAACCCGTTAAAGAAGAACCACTAAATGATTTTAATGGTACAGGTATGGCAATAACCGCCAAAAGGTCCAGAGGCAGACCAAGGAAAAGGCCAGTTGAAAATGATACAAAATATGCTTTAAGTGTGGAAGATGGATCAGATCTACCAAGCCCAAGTTGGAGACCTAAAAAGAGACTGATGCTGGGTGTAGTTGATCTAAATGGTTCAAAGAAATTATCTCCAGCAAAGCCTAGAGGGAGACCCAGGAAAAAACCAACTTCTGATAACAACAGTGTACAAAAATCTTTTCTTGCCAAGCCCAGAGGGAGACCTAGAAAACATTCGCCTCCAAGCATTGATAATTCAAATGACAAAGATGTCTCACGTCCTTGCAGTAACAATCAAATTCAGATTGTAAGTGAGTCTAATGTGTGTACAACTGTTAATTCAGGGAATAATGTAATGGCATTGTCCTTTTCTGCTGATGTAAATTGTGGGGAGGTAACAATTCAAAAAAGGTGCAGAGGAAGACCCAGAAAGAATTCCATTTCAACTGTAAATGATCATGTTCCAGAATCTGGGGTTGAATCAGGGAATGGTACATCTTTTTTGGCCACTTCAAGCAGATCTGAGACTTTGGACATGAATGAATCATTTTTATGCAGTAACAATGAGATTCAAAGTGCTGTTGATTTGGGGAATATTGCAGTGGCATCCCCTGTTTCTGCTGATGTAAATTGTGAGGAGGCAACAATTCAAAAGAGGTGCAGAGGAAGACCTAGAAAAAACTCCATTTCAAATGTAAATGAACACATTCCAGCATCTGGTGTTGAATCAGTGCATTGTACATCTTCCTTGGTCACTGCAACCAGACCTGAGACTTTGAACATGAATGAATCATGTTCATGCAGTAACAATCAGATTCCAAGTGCTGTTGATTTGGGGAATATTGCATTGGCATTGCCTGTTTCTGCCGATGTAAACTTTGAGGAGGGAATGATTCAAAAAAGATGCAGAGGAAGACCTAGAAAGAACTCCATTTCAGATGCAAATGAACACATTCCAACATCTGGTGTTGAATCAGGAAATGGTACATCTTCCTTGGCCACTTCAACCAGACCTGAGACTTTGAACATGAAAGGATCATTTTTATGCAGTCACAATCAGATTCTAAGTGCTCTTGATTCGGGGAATACTGCATTGCCATCTCCTGTTTCTGCTGATGTAAATTGTGAGGAGGGATCAATTGAAAGAAGATGCGAAGAAAACATCTCAAGTGTAAATGAATGTGTTCCAGCATCTGCTGTTGTATTAGGGAATGGTACATCTTCCTTAGCCACTCCAAGCAGATCTGAGACTTTGAACATTAATGAATCATGTCTATGCTGTAACAGTCAAATCCGAAGCACTGGTGAGTGTGTTCTGCATTTAACTGTCGAATCGGGGAATGCTGCATCAGCCTTACCTGTTTCTGCTGATGCACACTGTAATGAGGGAACGTGTCCTCCAAGGCGTAGAGGGCGACCTCGAAAGAGGCCACTTCCAACTATAAACAAGTGTGTTATGGCATCCGGTGTTGAATCAGGGAATGATGTATCTGTATTGCCAACTTGTAGCAGACCTGGTATTTCCAGTGTAGACAAATCACCTCTATTTAGTAATAGTCAAACTCTAAATGGAAGTGAGGGTTTTCTTCCTTGTGATCCAGGGAATTTTGGATTGGCATCATCTGATTCTGTTGATGTAAATTGTAAGGTGGATACAATTCAACAAAGGCACAGAGGGAGACATAGAAAGCAGCTAGTTTTAAGCTTGAACAAATGTTTTCTGGAATCTGGAGTTGAATCAGTGGATGATACATTAGCATTGCCCACTTCTAGAGGACCTGAGACATTGGATGTAGTTGAATCACCTCTGTACGGCAATTCTCAGGATGCGATGCTCTTAAGTAATGAAGCGGGCTGTGAGAGCTCATCTAAAGCTGACTTAACTAGTTTAATTCCAAGAGACATTGCTTTGCCCAGGGTTGTACTCTGTCTAGCTCACAATGGGAAAGTTGCATGGGATGTGAAATGGAGACCTTGCACCATCAACGATTCAGAAGGCATGCATCATATGGGTTATCTTGCTGTATTGTTGGGAAATGGTTCTCTGGAAGTGTGGGAAGTCCCAGCCCCTAGCATTGTCAAAGTTTTCTTTGCTTCTAGCTGCAGTGAGGGTACTGATCCTCGTTTTTTGAAATTGGAACCTGTATTCAGATGCTCAAAGGTGAAATGTGGAGATCGACACAGCATTCCTCTGACAATGGAGTGGTCACCTTCTGCCCCGCATGATCTAATATTAGCTGGATGCCATGATGGAACGGTTGCCTTGTGGAAGTTTGCTAAACAATATCCATCTCAAGATACAAAACCTTTACTTTGCGTCACGGCTGATTCTGCTCCTATAAGAGCACTTGCTTGGGCTCCAGAGGAAAGTGATAAGGAGAGTGCAAATCTTTTTGTGACTGCTGGACATGAAGGTTTAAAATTTTGGGACCTGCGTGATCCATACCGTCCGCTATGGGACTTGAATCCCACGCCAAGAGCAATTTTGAGCGTGGATTGGGTAAAACATCCTAGATGTATCGTCTTATCACTTGATGATGGAACCTTGAGGATCCTCAGCTTGTGGAAAGCAGCATATGATGTTCCTGTTACTGGAAGACCGTTTGCTGGAACAAAGTATCAAGGGCTGCATAACTTTGGCTGCTCATCTTTCGCCGTTTGGAGTGCCCAGGTGTCACGAACTCTAGGTCTCGTTGCTTATTGTAGTGCAGATGGATCTACAGTTCGGTTTCAGGTAACTGAACTATCTGATCTTACTGAAGCTGTGGACAAAGATCCAAAGCGAAACCCTAAACCGCATTTCCTTTGCGGGTCACTTATGGAAAAAGGCCAAGTTCTTGAGATCAATAGTCCGCTACCTGATGTTCCATTGCCCAACATTCCTTTTGTGCAAAAGAAGTCGGTTGATGACTGTGTGGACACTGCTCCGACCATGCAGTTGCATGGTTGCTTGTCAGATGTGGACCAGGCAAAACAAACAGGTCATGCTGTTTCAGGTAGTGAAGAAACAATGGGAAATACAACATCAAAATCCAGAAAGAATGAGAGGAAGAAACAGCATGCAAGTGCTATTGCTGTGCAAACAAAATTTCATGCTGAAATAGAGCAAGGGATATTGCAAAGAAACGAAAACAAAGATGAAGGATCTCCACAACAGTTTGAAGCACACCCTCCCAAAGTTGTGGCTATGCATAGGGTAAGGTGGAACATGAACAGAGGGAGTGAAAGATGGTTGTGTTACGGTGGAGCTGCAGGCATCATTCGATGTCAGCAAGTTTCTTTGCAAATGTAG
Protein:  
MGQLPRKTRNSDSNDFVMHAGGHVWGLDWCPQIHEKPHSNIKCEYLAVAAHPPGSTYHKIGVPLIGRGAIQIWCLLTLDEKVEFSLPRSKRGRPKKEPVKEEPLNDFNGTGMAITAKRSRGRPRKRPVENDTKYALSVEDGSDLPSPSWRPKKRLMLGVVDLNGSKKLSPAKPRGRPRKKPTSDNNSVQKSFLAKPRGRPRKHSPPSIDNSNDKDVSRPCSNNQIQIVSESNVCTTVNSGNNVMALSFSADVNCGEVTIQKRCRGRPRKNSISTVNDHVPESGVESGNGTSFLATSSRSETLDMNESFLCSNNEIQSAVDLGNIAVASPVSADVNCEEATIQKRCRGRPRKNSISNVNEHIPASGVESVHCTSSLVTATRPETLNMNESCSCSNNQIPSAVDLGNIALALPVSADVNFEEGMIQKRCRGRPRKNSISDANEHIPTSGVESGNGTSSLATSTRPETLNMKGSFLCSHNQILSALDSGNTALPSPVSADVNCEEGSIERRCEENISSVNECVPASAVVLGNGTSSLATPSRSETLNINESCLCCNSQIRSTGECVLHLTVESGNAASALPVSADAHCNEGTCPPRRRGRPRKRPLPTINKCVMASGVESGNDVSVLPTCSRPGISSVDKSPLFSNSQTLNGSEGFLPCDPGNFGLASSDSVDVNCKVDTIQQRHRGRHRKQLVLSLNKCFLESGVESVDDTLALPTSRGPETLDVVESPLYGNSQDAMLLSNEAGCESSSKADLTSLIPRDIALPRVVLCLAHNGKVAWDVKWRPCTINDSEGMHHMGYLAVLLGNGSLEVWEVPAPSIVKVFFASSCSEGTDPRFLKLEPVFRCSKVKCGDRHSIPLTMEWSPSAPHDLILAGCHDGTVALWKFAKQYPSQDTKPLLCVTADSAPIRALAWAPEESDKESANLFVTAGHEGLKFWDLRDPYRPLWDLNPTPRAILSVDWVKHPRCIVLSLDDGTLRILSLWKAAYDVPVTGRPFAGTKYQGLHNFGCSSFAVWSAQVSRTLGLVAYCSADGSTVRFQVTELSDLTEAVDKDPKRNPKPHFLCGSLMEKGQVLEINSPLPDVPLPNIPFVQKKSVDDCVDTAPTMQLHGCLSDVDQAKQTGHAVSGSEETMGNTTSKSRKNERKKQHASAIAVQTKFHAEIEQGILQRNENKDEGSPQQFEAHPPKVVAMHRVRWNMNRGSERWLCYGGAAGIIRCQQVSLQM